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Abstract 

The AP0BEC3 deoxycytidine deaminase family functions as liost restriction factors that can block replication of Vif (virus 
infectivity factor) deficient HIV-1 virions to differing degrees by deaminating cytosines to uracils in single-stranded (-)HIV-I 
DNA. Upon replication of the (-)DNA to (-I-)DNA, the HIV-1 reverse transcriptase incorporates adenines opposite the uracils, 
thereby inducing C/G^T/A mutations that can functionally inactivate HIV-1. Although both AP0BEC3F and AP0BEC3G are 
expressed in cell types HIV-1 infects and are suppressed by Vif, there has been no prior biochemical analysis of AP0BEC3F, in 
contrast to AP0BEC3G. Using synthetic DNA substrates, we characterized AP0BEC3F and found that similar to AP0BEC3G; it 
is a processive enzyme and can deaminate at least two cytosines in a single enzyme-substrate encounter. However, 
AP0BEC3F scanning movement is distinct from AP0BEC3G, and relies on jumping rather than both jumping and sliding. 
AP0BEC3F jumping movements were also different from AP0BEC3G. The lack of sliding movement from AP0BEC3F is due 
to an ^^°NPM^^^ motif, since insertion of this motif into AP0BEC3G decreases its sliding movements. The AP0BEC3G NPM 
mutant induced significantly less mutations in comparison to wild-type AP0BEC3G in an in vitro model HIV-1 replication 
assay and single-cycle infectivity assay, indicating that differences in DNA scanning were relevant to restriction of HIV-1. 
Conversely, mutation of the AP0BEC3F ^^Vro to ^^^Gly enables AP0BEC3F sliding movements to occur. Although 
AP0BEC3F ^^"NGM^^^ could slide, the enzyme did not induce more mutagenesis than wild-type AP0BEC3F, demonstrating 
that the unique jumping mechanism of AP0BEC3F abrogates the influence of sliding on mutagenesis. Overall, we 
demonstrate key differences in the impact of AP0BEC3F- and AP0BEC3G-induced mutagenesis on HIV-1 that supports a 
model in which both the processive DNA scanning mechanism and preferred deamination motif (AP0BEC3F, 5'TTC; 
AP0BEC3G 5'CCC) influences the mutagenic and gene inactivation potential of an AP0BEC3 enzyme. 

Citation: Ara A, Love RP, Chelico L (2014) Different Mutagenic Potential of HIV-1 Restriction Factors AP0BEC3G and AP0BEC3F Is Determined by Distinct Single- 
Stranded DNA Scanning Mechanisms. PLoS Pathog 10(3): e1004024. doi:10.1371/journal.ppat.l004024 

Editor: Christopher Aiken, Vanderbilt University School of Medicine, United States of America 

Received November 8, 2013; Accepted February 9, 2014; Published March 20, 2014 

Copyright: © 2014 Ara et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted 
use, distribution, and reproduction in any medium, provided the original author and source are credited. 

Funding: This work was supported by a Natural Sciences and Engineering Research Council of Canada Discovery Grant (http://www.nserc-crsng.gc.ca/index_eng. 
asp} and a Canadian Institutes of Health Research Operating Grant, H0P1 1 1407 (http://www.cihr-irsc.gc.ca/e/1 93.html). The funders had no role in study design, 
data collection and analysis, decision to publish, or preparation of the manuscript. 

Competing interests: The authors have declared that no competing interests exist. 

* E-mail: linda.chelico@usask.ca 



introduction 

APOBEC3F (A3F) and APOBEC3G (A3G) are members of a 
family of seven single-stranded (ss)DNA cytosine deaminases (A3 A, 
A3B, A3C, A3D, A3F, A3G, and A3H) [1] and play a role in 
restriction of the retrovirus HIV-1 (referred to as HIV) [2]. 
Research has been highly focused on primarily A3G and 
secondarily A3F for a number of years since they appeared to 
be the most efficient restrictors of HIV replication [3,4,5,6,7]. 
Although there are documented restrictive effects of A3G, and 
possibly A3F, at an individual level (reviewed in [8]), the 
suppression of HIV by A3G and A3F at a population level is 
lost due to the HIV protein Vif (viral infectivity factor) [6,9] . Vif 
forms an E3 ubiquitin ligase with host proteins and causes A3G 
and A3F polyubiquitination and degradation through the protea- 
some [6,10,11,12,13,14]. 

The general mechanism by which A3G restricts HIV, which has 
been a paradigm for other A3 enzymes, requires that it be 
encapsidated with the ribonucleoprotein complex of HIV [9, 1 5] . 
A3G requires its N-terminal domain (NTD), which can bind 



nucleic acids, for encapsidation into virions [16]. A3G catalyzes 
deaminations through its C-terminal deaminase domain (CTD) 
[16,17]. In the target cells that these virions infect, encapsidated 
A3G can deaminate cytosines to uracils in (— )DNA reverse 
transcribed from the RNA genome, after the reverse transcriptase 
associated RNaseH activity enables ssDNA regions on the 
(— )DNA to be accessed by the enzyme [18,19]. The uracils in 
the (— )DNA are used as a template by reverse transcriptase during 
(-l-)DNA synthesis and result in guanine to adenine mutations. If 
A3G can induce sufficient numbers of these mutations, the 
resulting proviral DNA will be functionally inactivated. The 
deaminases A3D, A3F, and A3H appear to follow this general 
mechanism of restriction in cell culture, but to differing degrees 
than A3G [3,4,20,21,22,23,24]. The exceptions are A3A, which 
inhibits incoming HIV viral particles in myeloid lineage cells 
[25,26], A3C, which does not appear to become encapsidated or 
restrict HIV in cell culture [20,27], and A3B, which can restrict 
HIV in 293T and HeLa cells, but not SupTl ceUs [20,28]. 

Despite a possible role for ASF, A3D and A3H Haplotype II in 
HIV restriction, it appears that A3G is more effective at restricting 
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Author Summary 

Human cells possess a family of seven DNA-modification 
enzymes, termed AP0BEC3, that function as part of our 
innate immune system. The enzymes modify cytosine in 
DNA which induces mutations. There are particular 
enzymes, AP0BEC3D, AP0BEC3F, AP0BEC3G and APO- 
BEC3H, that appear to be most relevant to restricting HIV-1 
replication in CD4+ T cells using this mutagenic mecha- 
nism, if they can avoid degradation that is induced by the 
HIV-1 protein Vif. There has been little biochemical analysis 
of AP0BEC3 enzymes other than AP0BEC3G in terms of 
the mechanism by which these enzymes search DNA for 
target cytosines to deaminate. We conducted a biochem- 
ical analysis of AP0BEC3F. We found that while AP0BEC3G 
uses 1 -dimensional sliding and 3-dimensional transloca- 
tions, AP0BEC3F is restricted to 3-dimensional transloca- 
tions. This makes the searching mechanism of AP0BEC3F 
superficial and detrimental to the induction of a large 
number of mutations. In addition, gene inactivation was 
less likely to occur upon deamination of the target motif of 
AP0BEC3F {5'TTC), in comparison to the target motif of 
AP0BEC3G (5'CCC). All together the data support a model 
in which the way these enzymes scan DNA can predict the 
magnitude of mutagenesis induced and the target motif 
can predict ability to cause gene inactivation. 

HIV replication and tliat perliaps the other A3 enzymes function 
in a collaborative way with A3G [20,2 1,22,23,24,29] . In particular 
there has been a recent focus on the restriction capability of ASF. 
A3F was initially identified as potentially being an equal 
contributor with A3G to the restriction of HIV [3,5,6,7,30], but 
current research demonstrates, in agreement with an earlier report 
[4], that A3F may have less antiviral activity than A3G 
[22,23,24,29]. Many difiFerent experimental protocols, such as 
analysis of stably expressed ASF from a cell line [23], use of 
primary cell lines [24], and ASF haplotypes from donor samples 
[22] have been applied and demonstrate that ASF has less of an 
effect on HIV infectivity in comparison to A3G. However, another 
report showed no difFerence in restriction efficiency of A3G and 
ASF beyond 2-fold using experiments that knocked-down endog- 
enous AS expression in a nonpermissive cell line [21]. As a result, 
the role of ASF in restriction of HIV remains unclear. 

Among reports demonstrating less of an affect of ASF in 
restricting HIV replication than A3G, there is stiU no identified 
reason for why this may occur. From some reports ASF mRNA is 
expressed 10-fold [31] or 5-fold [22] less than ASG mRNA, 
suggesting less ASF would become virion encapsidated. However 
another report found ASF and ASG mRNA expression was more 
comparable [32]. Further, some reports have found a direct 
correlation with mRNA and protein levels [31,32] whereas other 
reports have been unable to make such a correlation due to the use 
of different primary antibodies [24] . Confounding the interpreta- 
tion of these data are reports which demonstrated that ASF is 
preferentially encapsidated with the HIV ribonucleoprotein 
complex in comparison to ASG [4,33]. Song et al. concluded that 
the encapsidation difference between ASG and ASF in effect 
absolves any difference in cellular expression [33]. Despite this 
observed more specific packaging of ASF in the ribonucleoprotein 
complex [33], studies have found a minimal contribution of ASF 
to the hypermutation of HIV genomes or less potency in HIV- 1 
restriction [4,22,23,24,29]. Together these data suggest that if 
there is a difference in restriction efficiency of ASF and ASG, that 
it is not the physiological conditions which cause different effects 
on HIV infectivity, but an inherent difference in their biochemical 



characteristics. However, there has been no in depth biochemical 
characterization of ASF to date to determine what might be these 
differences between ASG and ASF. As such, we have undertaken a 
characterization of ASF in comparison to ASG to identify an 
underlying biochemical reason for these observations. 

In particular, we have focused on characterizing the mechanism 
ASF uses to scan ssDNA. This is because it has been shown that 
the ssDNA scanning mechanism of ASG is important for inducing 
mutagenesis of (— )DNA formed during reverse transcription of 
RNA [34]. ASG has been characterized to scan ssDNA through 
facilitated diffusion [35,36,37]. Facilitated diffusion is a 3- 
dimensional scan of DNA by enzymes to locate their target sites 
for catalysis [38,39,40]. The movement is characterized by sliding, 
jumping or intersegmental transfer motions. Sliding is used to 
describe short range 1 -dimensional scanning motions and can 
enable an in depth search of a particular area of DNA for a target 
motif [39,40] . Jumping is a term that describes microdissociations 
of the enzyme from the DNA with a reassociation on the same 
DNA substrate, i.e., the enzyme does not diffuse into the bulk 
solution [38,41]. The negative charge of the DNA establishes a 
charged radius around the DNA molecule in which a positively 
charged enzyme can dissociate, diffuse and still return back to the 
same DNA. These jumping events enable enzymes to translocate 
larger distances than sliding thus making the search of non-target 
DNA more efficient than sliding alone [39,40]. Intersegmental 
transfer is similar to jumping but describes a movement where an 
enzyme with two DNA binding domains interacts with two distal 
sites simultaneously before dissociating from one of the sites 
[39,40]. Different research groups, including our own, have found 
ASG to use a combined sliding and jumping search mechanism 
[34,35,37,42], although one report found ASG to use interseg- 
mental transfer [36]. We have characterized ASG mutants and 
ASG in complex with different Vif variants that resulted in 
decreases of either sliding or jumping motions and found that the 
ability of these ASG forms to induce mutagenesis of nascendy 
reverse transcribed DNA was decreased [34,43]. We have 
hypothesized that both sliding and jumping are important for 
inducing mutagenesis because ASG needs to conduct local 
searches (sliding) to effectively deaminate many cytosines, ensuring 
gene inactivation, and also translocate (jumping) over RNA/DNA 
hybrids to reach distal regions of (— )DNA [34]. The processive 
scanning of other A3 enzymes has not been reported, except ASA 
which was found to be largely nonprocessive [44]. 

This work is the first biochemical characterization of ASF and 
provides a biochemical explanation for the lowered ability of ASF 
to inactivate HIV, as reported by numerous research groups 
[22,23,24,29] and within this report. We have found that ASF 
primarily uses jumping movements to scan ssDNA which is 
detrimental to its ability to cause numerous mutations on (— )DNA 
during reverse transcription. The target motif of ASG (5'CCC) 
also appears to cause more inactivating mutations in the HIV 
protease [pwi) than the target motif for ASF (5'TTC), adding 
another level of deficiency in HIV inactivation potential. AU 
together our data provide a model for the specific biochemical 
properties required for efficient restriction of HIV by A3 
deaminases. 

Results 

ASF and ASG distinctively scan ssDNA 

The processive nature of ASG has been shown to be of 
importance for inducing mutagenesis of HIV (— )DNA in a model 
in vitro system [34,43] and in cell culture [45]. It is not known 
whether ASF is processive. Since multiple lines of evidence from 
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Figure 1. ASF and A3G are distinctively processive. Processivity of A3F and A3G were tested on substrates that contained an internal 
fluorescein (F)-label (yellow star) and two deamination motifs separated by different distances. The ASF substrates had 5'TTC motifs and the A3G 
substrates had 5'CCC motifs. (A) The two target cytosines within the 85 nt ssDNA sequence are spaced 30 nt apart. Single deaminations of the 5'C 
and 3'C are detected as the appearance of labeled 67- and 48- nt fragments, respectively; double deamination of both C residues on the same 
molecule results in a 30 nt labeled fragment (5'C & 3'C). (B) The two target cytosines within the 69 nt ssDNA sequence are spaced 14 nt apart. Single 
deaminations of the 5'C and 3'C are detected as the appearance of labeled 51- and 32- nt fragments, respectively; double deamination of both C 
residues on the same molecule results in a 14 nt labeled fragment (5'C & 3'C). (C) The two target cytosines within the 60 nt ssDNA sequence are 
spaced 5 nt apart. Single deaminations of the 5'C and 3'C are detected as the appearance of labeled 42- and 23- nt fragments, respectively; double 
deamination of both C residues on the same molecule results in a 5 nt labeled fragment (5'C & 3'C). (D) The two target cytosines within the 1 18 nt 
ssDNA sequence are spaced 63 nt apart. Single deaminations of the 5'C and 3'C are detected as the appearance of labeled 100- and 81- nt fragments, 
respectively; double deamination of both C residues on the same molecule results in a 63 nt labeled fragment (5'C & 3'C). (E) Deamination of the 
substrate described for (D), but with a 20 nt ssDNA annealed between the two target cytosines to block the sliding component of processivity. The 
measurements of processivity (Processivity factor) and the Standard Deviation of the mean (S.D.) are shown below the gel. The A3F: DNA ratio was 2:1 
except for panel (A) in which a 1:1 ratio was used. The A3G: DNA ratio was (A-B) 1:10, (C) 1:2.5, (D-E) 1:20. Enzyme: DNA ratios were varied due to 
different specific activities of the enzyme on a given DNA substrate. Values are an average from three independent experiments. 
doi:1 0.1 371/journal.ppat.l 004024.g001 
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Table 1. Comparison of apparent dissociation constants (Kj) 
from ssDNA of A3G and ASF wild-type and mutants. 




Enzyme 


Kd (nM) 


A3G 


130±6 


ASF 


20±1 


ASF CTD 


288±10 


A3G NPM 


56±4 


A3F NGiVl 


119±11 


doi:10.1371/journal.ppat.l004024.t001 



independent labs have shown that the effect of A3F on HIV is 
different than A3G [4,22,23,24], we sought to determine if there 
was an inherent biochemical difference between the two enzymes 
that could account for these observations. Specifically we 
determined if there was a difference in the processive scanning 
mechanisms of these two enzymes with processivity being defined 
as the ability to deaminate more than one cytosine on an ssDNA in 
a single-enzyme substrate encounter. Processivity was determined 
using different synthetic ssDNA substrates containing two deam- 
ination motifs separated by different distances, 5'TTC for A3F 
and 5'CCC for A3G. This strategy was used since with A3G we 
have found that closely spaced deamination motifs, i.e., 5 to 15 nt 
are deaminated most efTiciently through sliding motions and as the 
distance between deamination motifs increases a jumping motion 
facilitates processive deaminations [34]. The substrate usage was 
kept below 15% to ensure single-hit conditions were maintained, 
which means that each ssDNA was only encountered by an 
enzyme at most once during the reaction [46]. 

On a substrate with the target cytosines separated by 30 nt 
(Figure lA, sketch), A3F was able to catalyze processive 
deaminations. The processivity factor is a ratio of the frequency 
of double deaminations on a single substrate to the predicted 
frequency of double deaminations of a nonprocessive enzyme (see 
Materials and Methods). Therefore, the processivity factor of 3.7 
for A3F (Figure lA) means that in a single enzyme-substrate 
encounter A3F was 3.7-fold more likely to catalyze a processive 
deamination than a nonprocessive deamination. On the cognate 
A3G substrate, A3G was 2-fold more likely than A3F to catalyze a 
processive deamination (compare Figure lA, A3F, processivity 
factor of 3.7 and A3G, processivity factor of 7.9), suggesting that 
the processive mechanisms of ASF and A3G differ. In addition, we 
observed a difference in the ability of A3F and A3G to catalyze 5'- 
end biased deaminations. Where A3G has been found to prefer 
deaminations towards the 5 '-end of ssDNA molecules due to a 
catalytic orientation specificity [35], A3F had a minimal 5'-end 
bias (Figure lA, compare intensity of 5'C & 3'C bands for ASF 
and ASG). However, the presence or absence of a 5 '-end bias does 
not influence the processivity calculation [47]. Since ASG has been 
found to use a dual sliding and jumping motion to scan ssDNA 
[34,35,37], we investigated whether the difference between ASF 
and ASG was due to a difference in the contributions of sliding 
and jumping or a different mode of scanning, e.g., intersegmental 
transfer. 

First, we investigated the sliding ability of ASF. We conducted 
deamination assays on ssDNA substrates with closely spaced 
deamination targets, since it has been shown that sliding motions 
increase the frequency of closely spaced deaminations occurring 
processively [34]. With cytosines 14- and 5-nt apart, ASF was 
unable to catalyze any detectable processive deaminations 
(Figure IB-C, ASF, absence of 5'C & S'C band) indicating that 



ASF does not use sliding motions to catalyze processive 
deaminations. Of note, outside of single hit conditions (> 1 5 % 
substrate usage) we detected the band corresponding to deamina- 
tion of both 5'TTC motifs on an ssDNA (5'C & S'C band), which 
demonstrated that multiple molecules of ASF were able to 
deaminate these substrates at both cytosine targets to near 
completion (data not shown). In contrast, ASG was able to 
processively deaminate closely spaced residues under single-hit 
conditions by sliding (Figure IB-C, ASG, processivity factors of 
4.6 and 3.5). 

Since ASF was processive on the substrate with the target 
cytosines separated by SO nt (Figure lA, ASF), but not on 
substrates with closely spaced deamination motifs (Figure IB-C, 
ASF), the data suggested that ASF may use jumping or 
intersegmental transfer to processively deaminate cytosines. To 
investigate this further we determined the processivity of ASF on 
an ssDNA with the target cytosines separated by 63 nt (Figure ID). 
On this substrate, ASF exhibited a processivity factor of 4.6 
(Figure ID, ASF), which is higher than the processivity factor 
obtained on the substrate with the target cytosines separated by 
30 nt (Figure lA, ASF, processivity factor of 3.7). In contrast, ASG 
which can slide and jump [35] maintained a processivity factor of 
~8 (compare Figure lA and D, processivity factors). To confirm 
that we would observe only jumping or intersegmental transfer 
and not sliding motions, we annealed a 20 nt complementary 
DNA in between the target cytosines (Figure IE, sketch). The 
double-stranded DNA portion is not bound as tightly by ASF 
(Figure SIA-B) or ASG (Figure SIC and [35,37,48,49]) as ssDNA 
(Table 1) and results in the assay conditions blocking the sliding 
portion of the scanning activity [35,47] . ASG was still processive 
on this substrate due to the ability to translocate on DNA in 
S-dimensions by jumping, but we observed a ~2-fold decrease 
in A3G processivity as compared to the analogous ssDNA 
substrate (Figure ID-E, compare ASG processivity factors). We 
interpret that the ~2-fold decrease in ASG processivity is due to 
ASG molecules attempting to slide over the dsDNA which induces 
dissociation from the DNA substrate and diffusion into the bulk 
solution. For ASF we observed a 1.8-fold increase in the 
processivity factor when we annealed a 20 nt complementary 
DNA in between the target cytosines (Figure ID-E, compare ASF 
processivity factors), despite having a reduced binding to the 
double-stranded (ds)DNA portion (Figure SIA-B). The double 
deaminations became so efficient that the 5'- and 3 '-proximal 
cytosine deaminations were barely visible on the gel (Figure IE, 
ASF). ASF bound the 118 nt ssDNA substrate (Figure ID) with an 
apparent of 20 nM (Table 1), which is ~7-fold lower than the 
apparent Kj of ASG (Table 1, of 130 nM). This indicates that 
ASF is less likely to dissociate from an ssDNA substrate than ASG, 
but does not fully explain why we observed an increase in 
processivity by annealing a complementary DNA in between the 
target cytosines. Results were not changed by annealing a 20 nt 
complementary RNA molecule to the substrate (Figure S2A-B) or 
by testing ASF on a different partially dsDNA substrate which 
contained two 5'ATC motifs (Figure S3). We speculated that the 
processivity of ASF increased as opposed to remaining the same in 
the presence of the complementary DNA because the structural 
change in the substrate induced by the dsDNA region made 
jumping events more successful. This could occur if the average 
jumping distance of ASF were different than ASG and the rigid 
dsDNA region juxtaposed the 5'TTC motifs at a distance which 
was highly accessible by ASF. 

To test this hypothesis we examined the processive deamina- 
tions of ASF and ASG on an ssDNA substrate with deamination 
motifs separated by 100 nt. We found that as the distance between 
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deamination motifs was increased up to 100 nt, the processivity 
factors of ASF also increased (Figure 2A). In contrast, A3G 
processivity exhibited a plateau when deamination motifs were 
30- to 63-nt apart and the processivity factor decreased when 
deamination motifs were 100 nt apart (Figure 2B). These data 
demonstrate that the average jumping distance of ASF and ASG 
differ. Similar results were also found from analysis of 
deamination-induced mutations in the model HFV replication 
assay and are discussed later in the text. To identify a possible 
reason for the diffi'rc-nt jumping ability of ASF we examined its 
oligomerization state in comparison to ASG. .\SG is kn()\\Ti to 
form polydisperse oligomers that are dependent on enzyme 
concentration and buffer conditions [50]. Using size exclusion 
chromatography at low enzyme concentrations we found that 
ASF formed predominandy tetramers (~180 kDa) and higher 
order oligomers whereas ASG eluted as predominantly a 
monomer (~46 kDa) with minor dimeric species (Figure 2C). 
The finding that ASF forms more tetramers than ASG is 
consistent with previous sucrose gradient data [51] and data on 
the CTD portions of these enzymes. The ASF CTD can 
oligomerize more readily than the ASG CTD [52,53,54,55]. The 
ASF oligomers remained soluble as high speed centrifugation did 
not result in a discernable protein pellet. These data demon- 
strated that ASF oligomers are more stable than ASG oligomers 
at low protein concentration and suggest a structural difference 
that could account for why the ASF jumping distance is different 
than ASG (Figure 2A-B). 

However, since the ASF ssDNA scanning mechanism is more 
efficient in distal translocations (Figure 1), we also investigated 
whether it was scanning ssDNA by intersegmental transfers, 
rather than or in addition to jumping. This mode of DNA 
scanning involves an enzyme molecule that binds in two distal 
locations on the DNA before completing the translocation by 
dissociating from one location [39,40]. The intersegmental 
transfer mechanism requires that the enzyme have more than 
one DNA binding domain. ASF could bind ssDNA with both its 
NTD and CTD on one or many subunits of the oligomer. This 
is in contrast to jumping which uses microdissociations and 
reassociations to scan ssDNA [38,41]. A key difference between 
jumping and intersegmental transfer is that the probability of 
an enzyme transferring to another DNA substrate is low for 
jumping but high for intersegmental transfer [39,40,56]. 
Therefore, to observe whether ASF can scan ssDNA by 
intersegmental transfer we increased the enzyme and substrate 
concentrations, but kept their ratio constant. Crowding the 
reaction in this manner with enzyme and ssDNA can increase 
the tendency of thi; enzyme to translocate to a different ssDNA 
if intersegmental transfer is occurring [56]. This would result in 
a decrease in the observed processivity with increasing reaction 
components. We found that ASF maintained the same 
processivity at a 1:1 E:S ratio at concentrations of 100 nM 
and 200 nil (Figure 2D, processivity factor of 4.7 and 4.8). At a 
1:1 E:S ratio using concentrations of 300 nM and 400 nM the 
processivity of ASF decreased ~ 1.5-fold from 4.7 to 3.0 or 3.3 
(Figure 2D), providing evidence that ASF can use intersegmen- 
tal transfer to scan ssDNA. However, the decrease in ASF 
processivity is small (~1.5 fold), does not decrease gradually 
with increasing enzyme and substrate concentration, and is not 
completely abolished (processivity factor remains above 1) 
suggesting that intersegmental transfer is not the primary 
mechanism of DNA scanning, but can occur in a minority of 
ssDNA-ASF interactions. Importantly, intersegmental transfer 
should result in an increase in the reaction rate with increasing 
DNA concentration since the rate of searching is enhanced by 



increasing the apparent off rate, which allows more rapid 
sampling of DNA [56]. However, the reaction rate of ASF 
decreased with increasing enzyme and substrate concentrations 
(Figure 2D, Rate) and supports the conclusion that interseg- 
mental transfer is not a primary mode of scanning ssDNA. In 
further supjjort of this interpretation is that we only observed 
evidence of intersegmental transfer with increasing enzyme and 
substrate concentration (Figure 2D), not when the ssDNA 
concentration alone was increased (Figure 2E, processivity 
factors of 4.2 to 4.6), which indicates that ASF does not readily 
transfer to anotlu-r ssDNA without high local concentrations of 
enzyme, i.e., the intersegmental transfer is not inherent to ASF 
but requires excessive crowding of reaction conditions. ASG 
showed no decrease in processivity with increasing concentra- 
tion of enzyme and substrate, despite also containing both a 
NTD and CTD (Figure S5). This difference may arise since the 
CTD of ASG binds ssDNA in the micromolar range [54,55,57], 
in contrast to the CTD of ASF that can bind DNA in the 
nanomolar range (Table 1, apparent of 288 nM). All 
together the data supported the conclusion that ASF primarily 
utilized jumping and not intersegmental transfer to scan 
ssDNA. 

The A3F DNA scanning mechanism does not enable 
efficient mutagenesis of (-)DNA 

Our biochemical data on synthetic substrates (Figure 1) predicts 
that ASF will not efficiently catalyze deaminations during proviral 
DNA synthesis due to a predominant jumping movement that 
would result in a superficial scan of the ssDNA [34,43]. 
Importandy, we observed this predominant jumping movement 
when ASF encountered an RNA/DNA hybrid (Figure S2A), such 
as would be encountered during synthesis of the HIV provirus. To 
test this prediction we used our model in vitro HIV replication 
system. Since this system reconstitutes reverse transcription of 
(— )DNA and synthesis of (-h)DNA, it allows us to observe the 
ability of AS enzymes to induce mutagenesis in a dynamic system, 
such as occurs in vivo, but with the advantage of controUing the 
amount of enzyme added to the reaction system. Specifically, this 
system uses an in vitro synthesized RNA which contains (from the 
5'-end) a polypurine tract (PPT), part of the protease gene [jnot) of 
HIV, and a /flc^a reporter. The RNA is reverse transcribed to 
(— )DNA by reverse transcriptase and after RNaseH-mediated 
removal of the RNA, the PPT enables (-l-)DNA synthesis without 
the addition of an exogenous primer. In this manner we can 
achieve the salient properties of HIV replication that A3 enzymes 
must contend with, a finite time to access single-stranded (— )DNA 
and a heterogeneous substrate that is interspersed with RNA 
fragments. 

The ASG data demonstrated the potential amount of mutations 
that could occur in this system. ASG had a clonal mutation 
frequency of 2.6SxlO~^ mutations/bp which is 10-fold over the 
background mutation frequency of re\'ers(' transcriptase (RT) 
(Table 2). Further, the ASG mutation spectra have clear hot-spots 
at 5'CCC or 5'CC motifs in both theprotznA kc^a. with some sites 
being mutated in 100% of clones (Figure SA, e.g., 245 nt). Due to 
the PPT being nearest the prot, this region is converted to dsDNA 
the fastest and incurs less mutations than regions near(;r the cc-ntc-r 
or S'-end of the (+)DNA (Figure SA). As such, we can recover 
white colonies indicating a mutation in the lac^a reporter but upon 
sequencing find no mutations in the prot. Therefore, the number of 
clones with mutations in the prot is a measure of how efficientiy an 
A3 enzyme can induce mutations. The lac^a remains single 
stranded longer and can therefore be visited by multiple A3 
enzymes multiple times. In the prot region, ASG was found to 
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Figure 2. A3F translocations on ssDNA are distinct from A3G, but not due to intersegmental transfer. (A-B) Summary of processivity 
factors for (A) A3F and (B) A3G on ssDNA substrates where the two deamination motifs were separated by 5- to 100- nt. A3F processivity was not 
observable until the distance between cytosines was greater than 14 nt apart (denoted with x), but then increased until 100 nt (filled squares). This 
was distinct from A3G that was processive when cytosines were closely spaced (5- to 14- nt apart), reached a maximum processivity when cytosines 
were 30- to 63- nt apart and then decreased in processivity (filled squares). The grey area represents the region where sliding is required for 
processivity. Jumping has been previously defined to be translocations of >20 nt [72]. Gels for the substrate with deamination motifs separated by 
100 nt are shown in Figure S4. (C) Size exclusion chromatography demonstrates that A3F forms tetramers (—180 kDa) and higher order oligomers. 
This is in contrast to A3G which forms monomers (—46 kDa) and dimers. (D-E) Processivity of A3F was tested on a substrate that contained an 
internal fluorescein (F)-label (yellow star) and two deamination motifs (5'TTC) separated by 63 nt. Single deaminations of the 5'C and 3'C are 
detected as the appearance of labeled 1 00- and 81 - nt fragments, respectively; double deamination of both C residues on the same molecule results 
in a 63 nt labeled fragment (5'C & 3'C). (D) The enzyme; substrate ratio of 1;1 (E;S) was kept constant, but reaction components increased (100;100, 
200:200, 300;300, 400;400 nM) to investigate whether A3F could transfer between two ssDNA substrates. (E) In the presence of an unlabeled ssDNA 
trap (69 nt) the processivity factor of A3F (E:S of 1 ;1 ) remained the same regardless of trap concentration (1 ;0.5, 1 ;1 , or 1 ;5 ratio of labeled ssDNA to 
unlabeled trap ssDNA). The measurements of processivity (Processivity factor), Standard Deviation of the mean (S.D.), and Rate (%/min) are shown 
below the gels. Values are an average from at least two independent experiments. 
doi;1 0.1 371 /journal.ppat.1 004024.g002 



induce no mutations in 13% of clones, but the majority of clones 
had either 1-2 mutations (47%) or 3-4 mutations (37%) 
(Figure 3C). In the lacZ^, A3G-uiduced mutagenesis resulted in 
>7 mutations in the majority of clones (Figure 3D, 60%). 

Addition of A3F to the model HIV replication assay resulted in 
a modest 2.8-fold increase over the background mutation 
frequency (Table 2). Examination of the mutation spectrum 



demonstrated that A3F could induce mutagenesis at a number of 
5'TTC or 5'TC sites along the prot and /ac^a, but that there were 
no clear hot-spots, except possibly at position 305 nt of lac^a. 
(Figure 3B). This may be due to the random binding of A3F to the 
(— )DNA and an inefficient search of the enzyme by jumping 
without local scanning by sliding (Fignre 1), which would make 
interaction with multiple 5'TTC or 5'TC motifs less likely to 
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Table 2. A3-mediated mutation frequencies in a model HIV replication system. 





Enzyme 


Population mutation 
frequency 


Base pairs sequenced 


Total number of G^A 
mutations 


Clone mutation frequency (x10 ^ 
mutations/bp) 


A3G 


0.89 


11 040 


290 


2.63 


A3F 


0.61" 


10 304 


78 


0.76" 


A3G NPM 


0.14' 


9200 


27 


0.29" 


ASF NGM 


0.80 


10 672 


36 


0.34" 



The ratio of white colonies to total colonies is defined as the population mutation frequency. The average number of G— >A mutations per base pair in the 368 nt prof- 
/acZs construct is defined as the clone mutation frequency. For RT alone (no A3), all base changes were used to calculate the clone random mutation frequency which 
established a baseline of 0.27x10"^ mutations/bp. The RT alone condition produced a population mutation frequency of 0.07. 
"Significant difference was p<0.001 versus A3G values. 
doi:10.1371/journal.ppat.l004024.t002 



occur. Of note, the mutation frequencies induced by A3F and 
A3G did not increase with the addition of more enzyme to the 
reaction demonstrating that both ASF and A3G are present at 
saturating levels (data not shown). Analysis of the distances 
between A3F-induced mutations demonstrated that 75% of the 
mutations were separated by more than 20 nt (Table 3), 
confirming that A3F was using jumping this assay system, in 
agreement with the data on the synthetic oKgonucleotide 
substrates (Figure 1). In contrast, only 50% of A3G-induced 
mutations were separated by more than 20 nt (Table 3), providing 
confirmation that A3G is capable of recognizing sites that are 
more closely spaced (Figure 2A-B). The analysis in Table 3 
included all clones (highly mutated and sparsely mutated). To 
ensure we did not bias our analysis we also examined only sparsely 
mutated clones for both A3G and A3F (2-5 mutations) and 
obtained similar results for frequency of mutations separated by 
more than 20 nt (A3G, 60% and A3F, 85%). In addition, we 
hypothesized that the tight binding of A3F to ssDNA (Table 1), 
would prevent A3F from frequently dissociating into the bulk 
solution and reassociating with different (— )DNAs. In agreement 
with the binding data, A3F increased the population mutation 
frequency (frequency of white colonies) only 9-fold over the 
background whereas A3G caused a 12-fold increase in the 
population mutation frequency (Table 2). Although the overall 
level of mutagenesis induced by A3F was low, we did observe 
slightly more mutations in the lac^a than the prot region due to the 
replication kinetics (Figure 3E-F). In the majority of A3F clones 
(64%) there were no mutations in the prot region (Figure 3E). In the 
lac^oi region the majority of clones only had 1-2 mutations 
(Figure 3F, 36%). However, 29% of clones did not have a G^A 
mutation and were recovered due to an RT induced error 
(Figure 3F, 0-0). These data demonstrated that A3F was inefficient 
at inducing mutagenesis during reverse transcription even in areas 
where the enzyme had ample time to access ssDNA (lac^oc) and 
especially in regions that are single-stranded the shortest time (prot) 
(Figure 3E-F). 

Inactivation of HIV prot 

The increased distance between A3F-induced mutation sites in 
the HIV replication assay (Table 3) in combination with the data 
on synthetic oligonucleotide DNA indicating that A3F prefers to 
use jumping (Figure 1) provides evidence that the decreased 
mutagenic ability observed for ASF in cell culture may be due to 
an inefficient search mechanism on DNA. However, these 
observations are inconsequential if each mutation by ASF were 
to inactivate the prot gene, which is used here as a predictor of HIV 
inactivation potential. We gauged the probability that the prot of 
HIV would be inactivated by ASF by determining the mutated 



amino acid sequences and comparing this to an extensive 
mutagenesis study of the prot conducted by Loeb et al. [58], 
Consistent with ASF inducing a low number of mutations (Figure 3 
and Table 2), there were no ASF- induced mutations in 64% of 
clones (Figure 4B). On a per clone basis, ASF-induced mutations 
resulted in protease inactivation only 50% of the time (Figure 4B, 
18% active and 18% inactive). The high number of clones 
remaining active was due to two reasons. First, some clones 
incurred mutations in regions where any amino acid is tolerated 
[58], even nonconservative changes, e.g., E21K, so the mutation 
was insignificant (Table S2). Second, some clones incurred a 
mutation that resulted in a conservative change to the amino acid 
which enabled the protease to retain fuU or partial activity, 
depending on the proximity to the active site residues [58] (Table 
S2). For example, the M46I mutation was induced by ASF in 1 1 % 
of clones, but results an active and drug resistant protease. The 
remainder of the ASF-induced mutations were found only in single 
clones and 36% of those mutations resulted in an active protease 
(Table S2). AH together, ASF was not efficient at inactivating the 
HIV protease and could also induce resistance to protease 
inhibitors (Table S2, DSON and M46I). This was in contrast to 
ASG which caused inactivation of 84% of the clones and left only 
3% of clones active (Figure 4A). ASG also did not mutate some prot 
clones, but only 13% of the time (Figure 4A). ASG did induce 
protease drug resistant mutations in 10% of the population (Table 
S2, DSON), but the examination of these clones individually 
demonstrated that they were inactivated by other mutations. 
Overall, we found that per mutation ASG was more likely to cause 
an inactivating mutation than A3F. This appeared to be due to the 
ability of a 5'CCC motif to cause more nonconservative mutations 
than 5'TTC in the prot (Table S2). For example, ASG had clear 
hot spots that caused inactivation of the protease, e.g., W42 
STOP, 20% of clones; G51R, 36% of clones; G52S, 52% of clones 
(Table S2). 

Determinants of processivity for A3 enzymes 

To investigate the ASF DNA scanning mechanism further we 
made mutants in ASG and ASF to alter their processive scanning 
behavior. For ASG, the only other AS double Z-domain enzyme 
studied with regards to processivity, the NTD domain acts as a 
processivity factor [34]. The ASG CTD domain alone is non- 
processive (Figure 5A-B and [53,54]). In order to focus in on the 
determinants of processivity, we recombinantly expressed the CTD 
domain of ASF and tested its processivity using ssDNA substrates as 
in Figure 1 . We found that the CTD of ASF could not processively 
deaminate cytosines that were spaced 63- or 30- nt apart, similar to 
the CTD of ASG (Figiare 5A-B, absence of 5'C & 3'C band). 
The ASF CTD could also not processively deaminate target 
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Figure 3. A3F exhibits iow mutagenic potential in a modei IHIV repiication system. (A-B) Spectra of mutations are plotted as the 
percentage of clones containing a mutation at a particular location (nt) in the 368 nt prot-lacZa construct for (A) A3G or (B) A3F. (C-F) Histograms 
illustrate the disparity between the number of mutations that can be induced by A3G versus A3F in the (C, E) prot region that is single stranded for a 
shorter time than the (D, F) lacZa region. 
doi:1 0.1 371/journal.ppat.1 004024.g003 



cytosines 14- or 5-nt apart (Figure 5C-D, absence of 5'C & 3'C To determine the specific amino acids within the NTD that 

band), similar to the fioU-length ASF enzyme (Figure IB-C). These differentiate the processive scanning behaviors of ASF and ASG 
data indicated that the NTD of ASF was a processivity factor. we aligned their amino acid sequences and looked for differences 
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Table 3. Analysis of distances between G^A mutations for 
A3G and ASF. 





Enzyme 


Frequency of clones with mutations >20 nt apart 


A3G 


0.51 


A3F 


0.75° 



^Significant difference was p:^0.001 versus A3G values. 
doi:l 0.1 371 /journal.ppat.l 004024.1003 



in the predicted lielix 6 and loop 7 (Figure S6) since tliese regions 
have been shown to influence the scanning behavior of A3G [34]. 
Specifically, it was found that helix 6 mediated sliding movements 
and loop 7 mediated jumping movements [34]. Since we could not 
observe any scanning by sliding for A3F (Figure IB-C), we 
hypothesized that residues within or near predicted N-terminal 
heUx 6 would be different from A3G. For A3G, His 186 was found 
to be essential for sliding movements [34]. Although ASF has a 
Hisl81 equivalent to A3G (His 186), at the end of the predicted 
heUx 6 in the connection domain between the NTD and CTD, 
ASF has an additional three amino acids, '^"NPM'^^ m 
comparison to ASG (Figure 6A). To test whether the '^°NPM'^^ 
motif prevents ASF from sliding, we inserted the NPM motif into 
the equivalent position in ASG ('^'^NPM'^') creating an ASG 
NPM mutant. We then tested if ASG NPM was still able to 
undergo scanning by sliding. Using the ssDNA substrates with 
target cytosines close together enables the observation of 
processive deaminations by sliding [34]. On the substrate with 
cytosines separated by 5 nt, ASG NPM retained its processivity at 
an equivalent frequency to that of the wUd-type ASG (compare 
Figure 6B and Figure IC, processivity factors). On the substrate 
with cytosines separated by 14 nt, ASG NPM was essentially not 
processive, as evidenced by a processivity factor of 1 which means 
that ASG NPM double deaminations occurred at the same 
frequency as expected if they were uncorrelated (Figure 6C). This 
was in contrast to wild-type ASG that was able to processively 
deaminate cytosines located 14 nt apart (Figure IB, processivity 
factor 4.6). These data indicated that the NPM insertion had 
decreased the sliding distance of ASG. To ensure that jumping 
was not affected, we tested the ASG NPM mutant on a substrate 
with cytosines separated by 63 nt without or with a complemen- 
tary DNA or RNA annealed. First, we established the processivity 
on this substrate when fully single-stranded. Accordingly, the 
ASG NPM which had a decreased ability to slide, had a 
decreased processivity factor on this substrate in comparison to 
wild-type ASG (compare Figure 6D, processivity factor of 5. 1 to 
Figure ID, processivity factor of 8.1), but similar to ASF 
(Figure ID, processivity factor of 4.6). When a complementary 
DNA was annealed the processivity of the ASG NPM was not 
decreased (Figure 6E, processivity factor of 4.4) demonstrating 
that the jumping motion of ASG NPM was not affected. Similar 
results were found when a complementary RNA was annealed to 
the substrate (Figure S2C). This was in contrast to the 
characteristic 2-fold decrease in processivity observed with ASG 
when a complementary DNA or RNA is annealed in between 
two target cytosines (Figure ID-E and Figure S2B) consistent 
with the hypothesis that attempts to slide over the dsDNA region 
by wild-type ASG results in dissociation of the enzyme into the 
bulk solution. That we did not fmd an increase in the jumping 
efficiency for the ASG NPM (Figxire 6E), in contrast to ASF 
(Figure IE) is in agreement with published data that suggest the 




■ Inactive 
Active 

■ No Mutation 



Figure 4. Predicted ability of ASF and A3G to inactivate HIV 
protease. Each prof clone was individually analyzed to determine the 
percentage of clones that resulted in a mutated and inactive (red) prof, 
mutated and active prof (green) or prof with no mutations (black) after 
exposure to (A) ASG or (B) ASF. (A) ASG was able to inactivate the prof 
in 84% of clones and left an active prof in S% of mutated clones. ASG 
did not induce any mutations in the prof in 1S% of clones. (B) ASF- 
induced mutagenesis was less effective than ASG due to no mutations 
being induced in 64% of clones. Of the S6% of clones with a mutation, 
18% left the prof active and 18% inactivated the prof. 
doi:10.1S71/journal.ppat.1004024.g004 



determinants of jumping are separate from sliding and localized 
to predicted loop 7 (Figure S6 and [S4]). Further, the 
oligomerization state of ASG NPM is equivalent to wild-type 
ASG (data not shown), not ASF and this may influence the 
jumping distance of an AS enzyme (Figure 2A-C). 

To ensure that the effects on processivity were due to specific 
changes to the residues interacting with ssDNA while scanning, 
rather than solely due to a poor affinity for ssDNA, we 
examined ASG NPM by circular dichroism (CD) spectroscopy 
and the ability of ASG NPM to bind ssDNA using rotational 
anisotropy. The CD analysis confirmed that ASG NPM and 
ASG were structurally similar (data not shown). Interestingly, 
addition of the NPM residues to ASG resulted in a 2-fold 
increase in the binding affinity of ASG for the ssDNA, 
implicating these residues in the ssDNA-NTD interaction 
(Table 1, ASG, of ISO nM; ASG NPM, of 56 nM). 
The specific activity of ASG NPM was decreased ~S-fold in 
comparison to ASG (Table 4, ASG, 15 pmol/|Xg/min; ASG 
NPM, 5.5 pmol/ng/min). 

To further investigate the influence of the NPM motif in ASF, 
we attempted a reciprocal mutation, i.e., deleting the NPM motif 
from ASF. However, the mutant ASF did not express well in the 
6/9 expression system indicating that the NPM deletion caused a 
structural instability. To circumvent this we made a conservative 
mutation in ASF to change the NPM motif to an NGM motif 
We hypothesized that the Pro would have a significant influence 
on the functionality of the motif since Pro gives structural 
rigidity. We then tested the ability of the ASF NGM to 
processively deaminate two closely spaced deamination motifs 
by sliding. We found that ASF NGM was able to processively 
deaminate cytosines that were 5 nt and 14 nt apart (Figure 7A— 
B, processivity factors of 2.1 and 2.4), in contrast to ASF 
(Figure IB-C). When the distance between the cytosines was 
increased to SO nt or 60 nt apart, ASF NGM was able to 
undergo processive deaminations similarly to ASF (compare 
Figure 7C-D to Figure 1 A and D). Interestingly, the apparent 
of ASF NGM was 1 19 nM, which is 6-fold larger than the of 
ASF (Table 1, 20 nM) further implicating these residues in the 
enzyme-ssDNA interaction. The specific activity of ASF NGM 
was ~ 1.5-fold higher than ASF (Table 4). The ASF NGM and 
ASG NPM results demonstrated that the presence of an NPM 
motif blocks the ability of both ASF and ASG to processively slide 
on ssDNA. 
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Figure 5. Similar to the A3G CTD, tlie A3F CTD does not deaminate cytosines processively. Processivity of A3F CTD and A3G CTD were 
tested on substrates that contained an internal fluorescein (F)-label (yellow star) and two deamination motifs separated by different distances. The 
substrates had 5'TTC motifs (ASF CTD) or 5'CCC motifs (A3G CTD). (A) The two target cytosines within the 118 nt ssDNA sequence are spaced 63 nt 
apart. Single deaminations of the 5'C and 3'C are detected as the appearance of labeled 100- and 81- nt fragments, respectively. Double deamination 
of both C residues on the same molecule, which would result in a 63 nt labeled fragment (5'C & 3'C), could not be detected indicating that A3F CTD 
and A3G CTD are not processive on these ssDNA substrates. (B) The two target cytosines within the 85 nt ssDNA sequence are spaced 30 nt apart. 
Single deaminations of the 5'C and 3'C are detected as the appearance of labeled 67- and 48- nt fragments, respectively. Double deamination of 
both C residues on the same molecule, which would result in a 30 nt labeled fragment (5'C & 3'C), could not be detected indicating that A3F CTD 
and A3G CTD are not processive on these ssDNA substrates. (C) The two target cytosines within the 69 nt ssDNA sequence are spaced 14 nt apart. 
Single deaminations of the 5'C and 3'C are detected as the appearance of labeled 51- and 32- nt fragments, respectively. Double deamination of 
both C residues on the same molecule, which would result in a 14 nt labeled fragment (5'C & 3'C), could not be detected indicating that A3F CTD is 
not processive on this ssDNA substrate. (D) The two target cytosines within the 60 nt ssDNA sequence are spaced 5 nt apart. Single deaminations of 
the 5'C and 3'C are detected as the appearance of labeled 42- and 23- nt fragments, respectively. Double deamination of both C residues on the 
same molecule, which would result in a 5 nt labeled fragment (5'C & 3'C), could not be detected indicating that A3F CTD is not processive on this 
ssDNA substrate. The A3F CTD: DNA and A3G CTD: DNA ratios were 2:1. A representative gel from three independent experiments is shown. 
doi:1 0.1 371 /journal.ppat.1 004024.g005 



Contributions of processivity to efficient nnutagenesis of 
(-)DNA 

Our model predicts that the A3G NPM mutant should be a 
poor inducer of mutagenesis during (— )DNA synthesis due the 
decreased ability of this mutant to slide on ssDNA (Figure 6). In 
agreement with the model, the A3G NPM induced mutagenesis 
poorly in the model HIV replication system (Figure 8A), similar 
to ASF (Figure 3B), but in contrast to wild-type A3G 
(Figure 3A). The A3G NPM mutant had a mutation frequency 
in the HIV replication assay (Table 2, 0.29x10 ^ mutations/ 
bp), which was 9-fold less than wild-type A3G (Table 2, 
2.63 X 10~^ mutations/bp). The spectrum and sequence analysis 
demonstrated that the sparse mutations induced by A3G NPM 
were still in 5'GG or 5'GGG contexts, but that much fewer 
occurred (Figure 8A and Table S2). The A3G NPM mutant 
rarely induced mutations in the prot (Figure 80) and mutations 
in the lac^a region were less than A3F (compare Figure 8D and 
Figure 3F). Notably, the A3G forms had a 100- (A3G) to 40- 
(A3G NPM) fold greater specific activity than A3F (Table 4). 
However, since A3G NPM and ASF similarly induced less 
mutations (Figure 8A and Figure SB) than ASG (Figure SA), the 



data indicated that the ssDNA searching mechanism, but not 
the specific activity was a primary determining factor in levels of 
AS-induced mutagenesis. 

Our model, which is based on mutagenesis data from ASG, 
predicted that the mutation frequency of ASF NGM should 
increase in comparison to ASF. However, despite the ASF NGM 
mutant being able to sKde (Figure 7), we found that ASF NGM 
remained inefficient at inducing mutagenesis in the in vitro HIV 
replication assay (Figure 8B). The induced mutagenesis of ASF 
NGM (Table 2, O.S4xlO~^ mutations/bp) was more similar to 
ASF than ASG (Table 2). This could be due to ASF NGM sliding 
being ~2-fold less efficient than ASG (compare Figure 7A-B and 
Figure IB-C) or that the recovery of sliding alone in ASF is not 
sufficient for increasing the levels of mutagenesis. The latter 
possibility suggested another determining factor specific to ASF 
may affect its mutagenic ability. Namely, ASF NGM retained two 
distinct properties of ASF, the formation of tetramers and higher 
order oligomers (data not shown) and an increase in processivity 
with increasing distance between deamination motifs (Figure 7 
and data not shown). Therefore, we propose that the jumping 
mechanism of ASF that is retained in ASF NGM and distinct 
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Figure 6. Decreased processivity of A3G NPM was attributable to clianges in sliding, but not jumping movements. (A) The A3G NPiVI 
mutant was created by inserting tine NPIVI motif found in A3F into A3G immediately after Arg 194. (B-E) Processivity of A3G NPM was tested on 
substrates that contained an internal fluorescein (F)-label (yellow star) and two deamination motifs separated by different distances. The substrates 
had 5'CCC motifs. (A) The two target cytosines within the 60 nt ssDNA sequence are spaced 5 nt apart. Single deaminations of the 5'C and 3'C are 
detected as the appearance of labeled 42- and 23- nt fragments, respectively; double deamination of both C residues on the same molecule results in 
a 5 nt labeled fragment (5'C & 3'C). (C) The two target cytosines within the 69 nt ssDNA sequence are spaced 14 nt apart. Single deaminations of the 
5'C and 3'C are detected as the appearance of labeled 51- and 32- nt fragments, respectively. Double deamination of both C residues on the same 
molecule results in a 14 nt labeled fragment (5'C & 3'C) and were detected at a low level resulting in a processivity factor of 1 (below gel). Since the 
processivity factor is a ratio between the observed double deaminations and the theoretical deaminations expected to occur for a nonprocessive 
enzyme (see IVlaterials and Methods), the results indicated that the A3G NPM mutant was not processive on this substrate. (D) The two target 
cytosines within the 118 nt ssDNA sequence are spaced 63 nt apart. Single deaminations of the 5'C and 3'C are detected as the appearance of 
labeled 100- and 81- nt fragments, respectively; double deamination of both C residues on the same molecule results in a 63 nt labeled fragment (5'C 
& 3'C). (E) Deamination of the substrate described for (D), but with a 20 nt ssDNA annealed between the two target cytosines to block the sliding 
component of processivity. The measurements of processivity (Processivity factor) and the Standard Deviation of the mean (S.D.) are shown below 
the gel. The A3G NPM: DNA ratio was (B) 1 :2.5, (C) 1:10, (D-E) 1 :20. Enzyme: DNA ratios were varied due to different specific activities of the enzyme 
on a given DNA substrate. Values are an average from three independent experiments. 
doi:10.1371/journal.ppat.1004024.g006 



from that of A3G is detrimental to efiicient mutagenesis 
and remains as such even the presence of sliding movements. 
With this being considered, the contributing factors to the 
efficiency of A3-induced mutagenesis is not only the balance 
between sliding and jumping, as exemplified by A3G, but also 
the type of jumping movements, as exemplified by ASF. The 
ASF NGM mutant also retained the 5'TTC specificity charac- 
teristic of ASF and induced similar mutations in HIV prot 
(Table S2). 



A3 processive scanning mechanism determines ability to 
restrict HIV in single-cycle replication assays 

The biochemical data support the hypothesis that the processive 
scanning mechanism of the AS enzyme can determine its 
mutagenic potential during reverse transcription. However, the 
in vitro HIV replication assay used in our experiments cannot 
account for how the HIV capsid environment may influence A3 
enzyme-induced mutagenesis. Therefore, we used a single-cycle 
replication assay to test whether mutagenesis induced in the prot of 
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Table 4. Specific activities of A3G and ASF wild-type and 
mutants. 





Enzyme 


Specific Activity (pmoi/^g/min) 


A3G 


15±1 


A3F 


0.14±0.03 


A3G NPM 


5.5±1 


ASF NGM 


0.20±0.02 



The specific activity was determined using the substrate with target cytosines 
separated by 63 nt (118 nt substrate) and the values are shown with the 
standard deviation that was calculated from three Independent experiments. 
doi:10.1371/journal.ppat.1004024.t004 



HWABif proviral DNA by the deamination activity of A3G, A3F 
and tlieir mutant derivatives would recapitulate the results of A3- 
induced prot mutagenesis in the model HIV replication assay. In 
agreement with the biochemical data, in the HIV Avif proviral 
DNA the A3G-induced mutations/kb were 6- to 8-fold higher 
than those of A3F, A3G NPM or ASF NGM (Table 5). Upon 
analysis of codon changing mutations, we found that the A3G 
hotspot in the prot was the Trp 42 codon, which was mutated to a 
stop codon in all clones containing a mutation, except one clone 
(Table S3). Clones mutated by A3G-catalyzed deaminations also 
contained other inactivating mutations such as G51R/E or G86R 
(Table S3). In regards to hotspots, the data were similar for A3G 
NPM, although fewer mutations were recovered (Table 5 and 
Table S3). These data supported our biochemical data in which 
the decrease in sliding by A3G NPM in comparison to A3G 
resulted in a decrease of mutagenic potential (compare Figure 3A 
and Figure 8A). The prot clones exposed to A3F or A3F NGM had 
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Figure 7. A3F NGM is able to slide on ssDNA to catalyze processive deaminations. Processivity of A3F NGM was tested on substrates that 
contained an internal fluorescein (F)-label (yellow star) and two deamination motifs separated by different distances. The substrates had 5'TTC motifs. 
(A) The two target cytosines within the 60 nt ssDNA sequence are spaced 5 nt apart. Single deaminations of the 5'C and 3'C are detected as the 
appearance of labeled 42- and 23- nt fragments, respectively; double deamination of both C residues on the same molecule results in a 5 nt labeled 
fragment (5'C & 3'C). (B) The two target cytosines within the 69 nt ssDNA sequence are spaced 14 nt apart. Single deaminations of the 5'C and 3'C 
are detected as the appearance of labeled 51- and 32- nt fragments, respectively. Double deamination of both C residues on the same molecule 
results in a 14 nt labeled fragment (5'C & 3'C). (C) The two target cytosines within the 85 nt ssDNA sequence are spaced 30 nt apart. Single 
deaminations of the 5'C and 3'C are detected as the appearance of labeled 67- and 48- nt fragments, respectively; double deamination of both C 
residues on the same molecule results in a 30 nt labeled fragment (5'C & 3'C). (D) The two target cytosines within the 1 18 nt ssDNA sequence are 
spaced 63 nt apart. Single deaminations of the 5'C and 3'C are detected as the appearance of labeled 100- and 81- nt fragments, respectively; double 
deamination of both C residues on the same molecule results in a 63 nt labeled fragment (5'C & 3'C). The measurements of processivity (Processivity 
factor) and the Standard Deviation of the mean (S.D.) are shown below the gel. The A3F NGIVI: DNA ratio was 1:1. Values are an average from three 
independent experiments. 
doi:1 0.1 371 /journal. ppat.1 004024.g007 
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Figure 8. Cytosine deamination-induced mutagenesis by mutant A3G and ASF in a modei IHIV replication system. (A-B) Spectra of 
mutations are plotted as the percentage of clones containing a mutation at a particular location (nt) in the 368 nt prot-lacZn construct for (A) A3G 
NPM or (B) A3F NGM. (C-F) Histograms show the number of mutations per prof or lacZa region for (C-D) A3G NPM or (E-F) A3F NGM. 
doi:l 0.1 371/journal.ppat.1 004024.g008 



mutations that at best resulted in partial inhibition of protease 
activity, e.g., D30N or M46I, and none that resulted in complete 
inactivation of protease activity (Table S3). It is interesting that 
ASF NGM induced ~ 1.3-fold more mutations/kb than A3F, 
suggesting that there was a slight positive effect of the ASF NGM 
sliding ability on mutagenesis (Table 5). Overall, the prot 
sequencing data from HIV Avif proviral clones was consistent 



with the conclusions from the in vitro model HIV replication assay 
and many deamination hotspots were common between the two 
assays (compare Tables S2 and SS). Differences may have resulted 
from dififerent temporal dynamics of reverse transcription [59] and 
that the in vitro assay used a smaller segment of the prot gene. The 
observation from in vitro data that the 5'TTC motif was less able to 
cause inactivating mutations than the 5'CCC motif was consistent 



PLOS Pathogens | www.plospathogens.org 



13 



March 2014 | Volume 10 | Issue 3 | el 004024 



AP0BEC-3G and -3F Are Distinctively Processive 



Table 5. Analysis of A3-induced mutagenesis of prot DNA from integrated HWAvif. 





Enzyme 


Base pairs sequenced 


Total number of G^A mutations 


G— >A mutation frequency (mutations/kb) 


A3G 


6318 


37 


5.9 


A3F 


7371 


5 


0.7 


A3G NPM 


7020 


7 


1.0 


A3F NGM 


9126 


8 


0.9 



doi:10.1371/journal.ppat.1004024.t005 



with HIV /f»!f proviral DNA exposed to ASF or A3G (Table S3). 
Not only was the 5'CCC motif able to cause more inactivating 
mutations by overlapping with the Trp codon (5'TGG), which 
results in a stop codon, as previously observed [18], but also 
because it was more likely to cause nonconservative mutations in 
comparison to the 5'TTC motif (Table S3 and Ref [44]). 

The impact of A3G- and A3F-induced mutations on the 
infectivity of the proviral DNA was also examined using the cGFP 
reporter gene contained in the HIV pNL4-3 Avjf construct. 
Consistent with sequencing data from the prot region, the cGFP 
reporter gene of the integrated provirus from the same assays was 
inactivated 3- to 4-foId more in HFV /Ivjf virions exposed to A3G 
in comparison to ASF, A3G NPM or ASF NGM (Figure 9A). To 
ensure this was not due to differences in encapsidation efficiency 



between these A3 enzymes we conducted quantitative immuno- 
blotting on virions and cell lysates. Since we had transfected 
untagged A3 enzymes for these experiments to avoid the potential 
effects a tag may have on processivity (Figure S7), we initially 
standardized the antibodies for native ASG and ASF. Using 
equivalent amounts of purified protein and antibody dilutions, we 
determined that the antibody to ASF was 9-fold less sensitive than 
the antibody to ASG (Figure 9B). As a result, we used this as a 
correction factor in the calculated amounts of these enzymes in 
virions and cells (Figure 9C). The immunoblot results demon- 
strated that ASG and ASF were expressed in 293T cells and 
encapsidated into Avjf virions to a similar level (Figure 9B-C). 
Therefore, the data support that there is a bone fide difference in the 
inherent mutagenic abilities of ASG and A3F. We also confirmed 
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Figure 9. A3 enzyme processivity influences the HIVz/i/'// restriction efficiency. (A) Virus infectivity was measured by eGFP expression in 
293T cells infected with HIVJWf that was produced in the absence or presence of A3G, A3F, A3G NPM or A3F NGM. Results normalized to the no A3 
condition are shown with the Standard Deviation of the mean calculated from at least three independent experiments. (B-C) Quantitative 
immunoblotting was used to determine the levels of A3G, A3F, A3G NPM, or A3F NGM expressed in cells and encapsidated into HIVzlv/T virions. (B) 
The detection capabilities of antibodies to A3G (Apo CI 7, NIH AIDS Reagent Program) or A3F (C-18, NIH AIDS Reagent Program) was determined by 
detecting 10 ng of purified A3G or A3F with a 1/1000 dilution of the appropriate antibody. At this dilution, the antibody to A3F was found to be 
9-fold less sensitive than the antibody to A3G. This data was used as a correction factor during quantitation of blots (see Materials and Methods). The 
loading control for cell lysates was a-tubulin and for virions was p24. (C) Blots from at least three independent experiments were analyzed using 
Odyssey software to determine the band intensity in the cell lysate (black bars) or virions (gray bars). The values were then corrected to account for 
the antibody sensitivity (see Materials and Methods). Loading controls were confirmed during quantification to not be significantly different (data not 
shown). The error bars represent the Standard Deviation of the mean. A t-test determined that there were no significant differences in enzyme 
expression or encapsidation. 
doi:10.1371/journal.ppat.1004024.g009 
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that A3G and its NPM mutant and ASF and its NGM mutant 
were expressed in cells and encapsidated in virions similarly 
(Figure 9B-C) enabling comparisons to be made between the 
mutant and wild-type forms of the enzymes. The analysis of A3G 
or ASF mutants from single-cycle infectivity assays was consistent 
with biochemical data. The A3G NPM mutant that had 
diminished shding ability was less able to restrict HIV replication 
than A3G (Figure 9A, 3-fold). ASF NGM was able to decrease 
HYVAvjf infectivity 10% more than ASF, suggesting a slight 
positive effect of its sliding abiUty, but this was not statistically 
significant (Figure 9A). These data provided evidence that the 
processive scanning mechanism of the AS enzyme influences the 
capacity to restrict HIV in a single cycle of replication. The 
disparity in HIV restriction efficiency was confirmed to be due to 
differences in mutational load by sequencing the TlTVJvif 
integrated provirus eGFP reporter gene (Figure S8). 

Discussion 

Reports have demonstrated that ASF is less effective than ASG 
at restricting HIV replication and leaves less of a mutational 
footprint [22,23,24,29] . This could be due to many reasons such as 
differences in mRNA/protein expression levels [22,31], virion 
encapsidation levels [4,33], deamination site preference [4,18], or 
the inherent biochemical characteristics of the enzymes that 
govern deamination activity during proviral DNA synthesis. There 
is no consensus in the literature regarding whether any of the 
variables determined by cellular conditions, e.g., mRNA expres- 
sion levels, create disparity between ASF and ASG HIV restriction 
activities. In addition, other reports have found an equal capacity 
of ASF and ASG to restrict HIV [3,5,6,7,20,21,30]. To account 
for these differences in the literature we undertook a biochemical 
characterization of ASF in comparison to ASG. The data have 
enabled us to form a biochemical model to account for cell-based 
observations and propose that the processive DNA scanning 
mechanism and the preferred deamination motif of A3 deoxycy- 
tidine deaminases are determinants of HIV restriction efficiency. 

The data support the hypothesis that a balanced sliding and 
jumping scanning mechanism is a major contributor to efficient 
restriction of HIV [34] and ASF has less potential to restrict HIV 
because it does not slide and uses a jumping translocation 
mechanism that is different than ASG (Figure 1 and Figure 2). 
Analysis of ASG and ASF mutants further support the model in 
which the mechanism that the enzymes scan DNA and not their 
specific activity can fully account for differences obser\'ed in HIV 
restriction (Table 4, Figure 3, Figure 8, and Figure 9). In addition, 
ASF-induced mutations in preferred 5'TTC/5'TC motifs were 
less efficient at inducing gene inactivation than the preferred ASG 
5'CCC/5'CC motifs, similar to what was identified for ASA 
(prefers 5'TTC/5'TC) [44], adding another distinction in the 
mutagenic ability of ASF (Tables S2 and S3). 

Howe\er, the data cannot support that ASF has no effect on 
HIV since it is suppressed by Vif [6], but there is evidence that the 
restriction abiUties are distinct from ASG in regards to mutagenic 
load, selection pressure on HIV and contribution of deamination- 
independent HIV restriction [24,29,60,61,62,63]. It was initially 
recognized by Zennou and Bieniasz that per mutation, ASG could 
cause a much larger decrease in HIV infectivity than ASF [4] . This 
early study on ASF [4] was in contrast to other early studies 
published showing ASF was similar in effectiveness to ASG 
[3,5,6,7,30]. Such incongruent data still remains in the literature 
[21,22,23,24,29] and may be due to different experimental 
systems. Specific to our data, we observed that the HYVAvif 
retained 51% infectivity in the presence of ASF and 13% 



infectivity in the presence of ASG, suggesting that ASF is not as 
effective as ASG at restricting HIV (Figure 9A). However, Albin 
et al. found that over multiple replication cycles, ASF restricted 
HIV replication similarly to ASG and selected for Vif mutant 
revertants [61]. It may be that ASG requires only one exposure to 
HIV for high level restriction compared to ASF that may require 
multiple cyties for strong HIV restriction, but the end point is the 
same. Importandy, multiple infection rounds more closely mimics 
how A3 enzymes would interact with HIV in vivo. Nonetheless, our 
data propose that the mechanism by which ASG and ASF reach 
this end is different and that ASF has the potential to cause more 
sequence diversification of HIV than ASG. This idea is supported 
by Chaipan et al. that found that ASF suppressed HIV in multiple 
rounds of replication but required a longer period of exposure to 
HIV before the level of suppression reached that of ASG [24]. 
This is consistent with our sequence data from the prot of 
integrated proviruses (Table SS). As such, the role of ASF may be 
to supplement mutagenesis induced by ASG [21,29] since their 
effects have been shown to be additive [5] or be distinct from ASG 
and perhaps rely on a deamination-independent mechanism, such 
as inhibition of reverse transcription and integration [60,63,64]. 
ASF has been reported to exert a larger deamination-independent 
inhibition of HIV repfication than ASG, but this is not as effective 
as deamination-mediated restriction of HIV [60,63] . 

To characterize the mechanism by which A3 enzymes induce 
mutagenesis we studied the ASG NPM mutant. The ASG NPM 
mutant demonstrated that the scanning mechanism on DNA and 
not specific activity is a primary determinant in mutation 
induction during reverse transcription. Since ASF had a lower 
specific activity than ASG (Table 4), it could be argued that this 
was contributing to the lower level of induced mutagenesis 
(Figure 3). However, the ASG NPM mutant, which had decreased 
sliding in comparison to wild-type ASG (compare Figure 1 and 
Figure 6), retained a specific activity more similar to that of ASG 
than ASF (Table 4), but induced a very low level of mutagenesis 
(Figure 8A, C-D) and decreased HIV infectivity only 2-fold versus 
ASG that decreased HIV infectivity 8-fold (Figure 9A). These data 
suggest that specific activity is not a determinant in the ability to 
cause mutations during reverse transcription and is supported by 
previous data in which the specific activity of the enzyme was 
inconsequential during reverse transcription [43] . This appears to 
be because the activity of the enzyme during reverse transcription 
is instead determined by factors such as (— )DNA synthesis and 
RNaseH activity [43]. The ASG NPM mutant further confirmed 
that the determinants within the NTD for sfiding involve residues 
near ASG predicted helix 6 and that this is distinct from the 
determinant for jumping (Figure 6D-E), which in ASG is loop 7 
(Figure S6). Of note, the NPM motif is predicted to be at the end 
of NTD heUx 6, which is a connection point between the NTD 
and the CTD domains [65,66]. Through amino acid sequence 
alignment we identified that A3D is the only other double domain 
AS deaminase to contain an NPM motif at the end of predicted 
helix 6, suggesting that A3D would also lack sliding movements 
while scanning ssDNA, similarly to A3F. Some specific residues 
within helix 6 have previously been shown to affect specific activity 
[34,66], possibly because of structural changes in the connection 
between the NTD and CTD that can affect the catalytic activity of 
the CTD or DNA binding affinity. Insertion of the NPM motif 
into ASG immediately after the predicted helix 6 ends (Figure 6A) 
did not cause a large disruption in structure based on CD spectra 
(data not shown), but did result in a ~S-fold decrease in specific 
activity (Table 4) and ~ 2-fold increase in binding affinity for 
ssDNA (Table 1). 
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To confirm a role of the NPM motif in blocking sliding 
movements we mutated this region in ASF to create an ASF NGM 
mutant with the hypothesis that removing the rigid proline residue 
would enable the enzyme to slide on ssDNA and deaminate closely 
spaced residues. Consistent with the hypothesis, closely spaced 
residues were processively deaminated by ASF NGM (Figure 7A- 
B). However, the ability to slide did not enable ASF NGM to 
induce high levels of mutagenesis similar to ASG in vitro (Figure 8) 
or in a single-cycle infectivity assay (Figure 9A and Table 5). This 
does not preclude that jumping and sliding are important for 
inducing mutagenesis in virus inftx tcd cells since the A3G NPM 
mutant that had decreased sliding restricted HIV similarly to ASF 
in single cycle infectivity assays (Figure 9A). Rather, these data 
indicated that the ability to slide and jump is necessary, but not 
suHicient to induce high levels of mutagenesis. The data supported 
the conclusion that the type of sliding and jumping movements, 
e.g., distance transversed was also important. Namely, we found 
that ASF processivity on ssDNA increased with increasing distance 
between deamination motifs, in contrast to ASG, demonstrating 
that the average jumping distance of ASF was larger than ASG 
(Figure 2A-B). This was confirmed with sequence analysis from 
the model HIV replication assay in which a larger number of 
deaminations were >20 nt apart for ASF than ASG (Table S). 
Thus, the ASF NGM mutant could slide, but was not truly a 
mimic of ASG. All together, it appears that the sliding and 
jumping mechanism of ASG is specifically optimal to induce a 
large number of deaminations during reverse transcription of 
DNA. 

An important note regarding the study of ASF is that we found 
N-terminally tagged GST-A3F was not processive (Figure S7), 
despite binding ssDNA with a of 46 ±4 nM. That the binding 
affinity of GST-ASF was more similar to ASF (Table 1, of 
20 nM) than ASF CTD (Table 1, Kj of 288 nM) indicated that 
the GST-ASF was able to bind ssDNA witii both NTD and CTD 
domains, despite a lack of processivity. This suggested that the 
GST tag caused steric hindrance on amino acid determinants for 
processivity in the NTD. Interestingly, we observed that non- 
processive ASF forms, both ASF CTD and GST-ASF, induced 
more mutations than wild-type, processive ASF (compare Figure 3 
and Figures S9 and SIO). We also found that ASA, which is largely 
nonprocessive, induced slightiy less mutations than ASG in the in 
vitro HIV repfication assay [44], but more than ASF. Although this 
initially seems difficult to reconcile, it is consistent with the overall 
hypothesis that processivity is related to mutagenic potential, since 
processive ASG is still the most efficient at inducing mutagenesis. It 
is only that a lack of processivity appears to be better than an 
"ineffective" processive enzyme such as ASF. This is not due to 
differences in the assay systems for characterizing processive 
deaminations on ssDNA oligonucleotides and the model HIV 
replication assay since addition of NC and RT to the ssDNA 
oligonucleotides in a deamination reaction did not change our 
observations regarding ASF CTD processivity (Figure Sll). In 
comparison to the nonprocessive ASF CTD and GST-ASF, 
processive ASF leaves many potential deamination motifs 
unmodified (compare Figure S and Figures S9 and SIO). Although 
there is inefficiency in the GST-ASF and ASF CTD having to 
dissociate and reassociate with the substrate many times, the 
reassociations can be much closer to the previous dissociation 
resulting in a more thorough search of the DNA. For example, we 
found that in the model HIV replication assay, 61 % of ASF CTD- 
induced mutations were >20 nt apart in contrast to ASF where 
75% of induced mutations were >20 nt apart (Table 3 and data 
not shown). Since the HIV replication assay is not conducted 
under single hit conditions, the results emphasize the inefficiency 



of the searching mechanism used by ASF. Since the binding 
afiinity of ASF for ssDNA is tighter than ASG or ASF CTD 
(Table 1), it is conceivable that ASF may also have a lack of 

frequent movements or excursions on the ssDNA that contribute 
to the inefficient search for deamination motifs. However, 
resolution of this speculation awaits single-molecule analysis. In 
sum, the data demonstrated that the interactions of ASF with 
ssDNA are essentially detrimental to its ability to induce a high 
mutation frequency. 

Our data demonstrate two main points. First, the data provide a 
biochemical reason for the inefficiency with which ASF-induces 
mutagenesis of YilWAvif as observed in this report and by others 
[22,2S,24] by demonstrating that the processive scanning behavior 
of ASF is detrimental to its mutagenic potential. The data establish 
that a balanced sliding and jumping ssDNA scanning mechanism 
similar to ASG is required for the most efficient induction of HIV 
mutagenesis. Set:ondly, the data show that deamination of 
5'CCC/5'CC has more gene inactivating potential than 
5'TTC/5'TC providing an additional reason for less restriction 
of HIV by ASF than ASG, in agreement with previous reports 
[4,18,44]. The data does not preclude that ASF can effectively 
restrict HIV and is in agreement with studies showing that ASF 
can restrict HIV in multiple rounds of infection [24,61], but since 
the number of mutations induced has been correlated to HIV 
inactivation [45,67], the data support the interpretation that ASF 
inactivates HIV less efficiently than ASG in a single round of 
infection. 

Materials and Methods 

Protein expression and purification 

Recombinant baculovirus production for expression of GST- 
ASG, GST-ASF (NCBI Accession BC0S8808), GST-A3G CTD 
(amino acids 197-380), GST-ASF CTD (amino acids 195-373), 
GST-ASG NPM, GST-ASF NGM or GST-nuckxx.apsid protein 
(NC) in Sfi cells was carried out using the transfer vector pAcG2T 
(BD Biosciences), as previously described [S4,S5,54]. Site directed 
mutagenesis was used to create the ASG NPM and ASF NGM 
clones. Cloning primers for A3 enzymes and the site directed 
mutagenesis primers were obtained from Integrated DNA 
Technologies and are listed in Table SI. Sf) cells were infected 
with recombinant virus at a multiplicity of infection (MOI) of 1 , 
except for GST-ASF and GST-ASF CTD which were infected at 
an MOI of 2. Recombinant baculovirus infected Sfi cells were 
harvested after 72 h of infection. Cells were lysed in the presence 
of RNaseA and the proteins (ASG, ASG NPM, ASG CTD, and 
NC) were purified as described previously [54] to obtain protein 
that was cleaved from the GST tag and 95% pure. The ASF, ASF 
NGM, and ASF CTD enzymes were eluted from the glutathione- 
sepharose resin (GE Healthcare) with the GST tag, as previously 
described [35]. The samples were then treated with thrombin 
(Merck MiUipore; ASF and ASF NGM, 0.02 U/^iL; ASF CTD, 
0.10 U/nL) for 2-5 hours at 21°C to cleave the GST tag. A 
DEAE Fast Flow column (GE Healthcare) was then used to purify 
tiie ASF, ASF NGM, and ASF CTD from tiie GST tag and 
thrombin. The proteins were loaded in low salt buffer containing 
50 mM Tris pH 8.0, 50 mM NaCl, 10% glycerol, and 1 mM 
DTT. A linear gradient from 50 mM NaCl to 1 M NaCl was used 
to differentially elute the enzymes. The enzymes eluted at 
approximately 450 mM NaCl and were 90% pure. The SDS- 
PAGE gels of the purified AS enzymes are shown in Figure SI 2. 
Protein fractions were stored at -80°C. HIV RT (p66/p51) [68] 
was generously provided by Dr. Stuart F.J. Le Grice (NCI, 
National Institutes of Health). 
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Size exclusion chromatography 

The oligomerization state of A3 enzymes was determined by 
subjecting 10-15 |Xg of the purified enzymes to size exclusion 
chromatography using a 10 mL Superdex 200 (GE Healthcare) 
resin bed contained in a column with a 0.5 cm diameter and 
16 cm height. The running buffer used was 50 mM Tris pH 7.5 
and 200 mM NaCl. The Bio-Rad gel filtration standard set was 
used to generate a standard curve from which molecular masses 
and oligomerization states were calculated. 

Model HIV replication assay 

A3-induced mutagenesis of ssDNA during reverse transcription 
of an RNA template was measured using an in vitro assay, which 
models reverse transcription from an RNA template and second 
strand synthesis, and was performed as described previously [34] . 
Briefly, a synthetic (+)RNA is synthesized that contains a 
polypurine tract (PPT), 120 nt of the catalytic domain of the 
HIV protease (prot), and foc/^a (248 nt). The PPT is used as a 
primer for (+)DNA synthesis and enables synthesis of dsDNA. The 
/ac/^a serves as a reporter gene for mutations by blue/white 
screening. The HIV protease gene was obtained by PGR using 
clone p93TH253.3 obtained through the AIDS Research and 
Reference Reagent Program, Division of AIDS, NIAID, NIH 
from Dr. Feng Gao and Dr. Beatrice Hahn [69]. The RNA 
template (50 nM) was annealed to a 24 nt DNA primer [34] and 
incubated with NC (1.5 nM), RT (1.2 \xM), and dNTPs (500 nM) 
in RT buffer (50 mM Tris pH 7.4, 40 mM KCl, 10 mM MgGl^, 
1 mM DTT) in the presence or absence of 200 nM of A3G, A3F, 
A3G NPM or A3F NGM. Synthesized dsDNA was PGR amplified 
using Pfu Turbo Hotstart (Agilent Technologies) that can use 
uracils as a template with high fideUty. The amplicons were cloned 
into a pET-Blue vector backbone that would allow the experi- 
mentally synthesized lac^a to be used for a-complementation [34] . 
At least twenty-five mutated clones for each condition tested were 
analyzed. DNA sequencing was carried out at the National 
Research Gouncil of Ganada (Saskatoon, Saskatchewan). A t-test 
was used for statistical analysis of sequences. 

Deamination assays 

The ssDNA substrates were obtained from Tri-Link Biotech- 
nologies and are Usted in Table SI. Deaminations were detected 
by resolving Fluorescein (F)-labeled DNA that had been treated 
with Uracil DNA Glycosylase (New England Biolabs) and heated 
under alkaline conditions on a 10'X>, 16'/o, or 20% v/v denaturing 
polyacrylamide gel, as described previously [35] . The gel type was 
determined by fragment sizes produced by each substrate. 
Reactions were carried out under single hit conditions, i.e., 
<15% substrate usage [46], to ensure that a single ssDNA 
substrate was interacting with at most a single enzyme. Under 
these conditions, a processivity factor can be determined by 
comparing the total number of deaminations occurring at two sites 
on the same DNA substrat(; to a calculated theoretical value of the 
expected deaminations that would occur at those two sites if the 
deaminations were not processive (see reference [35]). In order to 
obtain substrate usage within this range under steady-state 
conditions, the enzyme and DNA concentration were varied 
based on tlu; (;nzyme sp(;cific activity. More ssDNA was used with 
A3G to ensure clear observation of all deamination bands despite 
the large preference for the 5'G. However, the data are not altered 
with ssDNA concentration (data not shown). For A3G and A3G 
NPM, 30, 40, or 100 nM enzyme was incubated with 300 or 
500 nM fluorescein (F)-labeled ssDNA. For A3F, A3F NGM, and 
A3F GTD, 100 nM enzyme was incubated witii 50 or 100 nM F- 
labeled ssDNA. For A3G GTD, 1000 nM enzyme was incubated 



with 500 nM F-labeled ssDNA. Reactions were incubated at 37°G 
for 1-50 min. Gel pictures were obtained using a Typhoon Trio 
(GE Healthcare) multipurpose scanner and analysis of integrated 
gel band intensities used ImageQuant software (GE Healthcare). 
The specific activity was calculated from single-hit condition 
reactions by determining the picomoles of substrate used per 
minute for a microgram of enzyme. 

Steady state rotational anisotropy assays 

Steady state fluorescence depolarization (rotational anisotropy) 
was used to measure enzyme-ssDNA binding affinities using the 
same F-labeled ssDNA substrates (with cytosines 63 nt apart) that 
were used for deamination reactions (Table 81). Reactions were 
60 nL and contained F4abeled ssDNA (10 nM) in RT buffer and 
A3G (0-650 nM), A3F (0-80 nM), A3F GTD (0-600 nM), A3G 
NPM (0-350 nM), or A3F NGM (0-650 nM) were titrated into 
the reaction. A QuantaMaster QrvI-4 spectrofluorometer (Photon 
Technology International) with a dual emission channel was used 
to collect data and calculate anisotropy. Measurements were made 
at 21°G. Samples were excited with vertically polarized light at 
495 nm (6 nm band pass) and vertical and horizontal emissions 
were measured at 520 nm (6 nm band pass). Apparent dissociation 
constants (K,j) were obtained by fitting to a sigmoidal curve using 
Sigma Plot 11.2 software. 

Single-cycle infectivity assay 

VSV-G pseudotyped HIV pNL4-3 viruses were produced 
by transfecting 3x10^ 293T cells per well in a 6-well plate with 
Qiagen Polyfect reagent. Specifically, transfections used 1100 ng 
of pHrVzlz;jr [70], which expresses an eGFP reporter gene and 
630 ng of pLTR-G (Addgene), which expresses the VSV-G 
protein, in the presence or absence of 220 ng of A3G, A3F or 
A3F NGM or 350 ng of A3G NPM in pcDNA3.1. The 
transfections used empty pcDNA3.1 to achieve equivalent 
amounts of DNA. The cotransfection molar ratio of A3 enzymes 
in pcDNA3.1 to tiie pNL4-3 Avifwa& 0.33:1 (A3G, A3F, or A3F 
NGM) or 0.59:1 (A3G NPM). The A3G (cat# 9952) and A3F (cat 
7^ 10100) expression plasmids were obtained from the NIH AIDS 
Reagent program with G-terminal tags. A stop codon was 
introduced immediately after the A3G or A3F coding sequence 
to enable expression of native A3 enzymes. The amino acid 
sequence of the A3G and A3F clones were identical to those used 
in biochemical assays. Subsequentiy, site directed mutagenesis was 
used to create the A3G NPM and ASF NGM clones. The site 
directed mutagenesis primers were obtained from Integrated DNA 
Technologies and are listed in Table SI. Sixteen hours after the 
transfection, the cells were washed with PBS and the medium 
replaced. Virus-containing supernatants were collected 48 hours 
after the media change and filtered through 0.22 Jim syringe 
filters. Virus was quantified by a p24 enzyme-linked immunosor- 
bent assay (QuickTiter Lentivirus Titer Kit, Gell Biolabs Inc.). 
Target 293T cells were infected at an MOI of 0.5 by spinoculation 
at 800 X g for 1 h in the presence of 8 |Xg/ml of polybrene [71]. 
Inft-ction levels in 293T cells was determined by flow cytometry by 
detecting eGFP fluorescence at 48 hours post infection and data 
were normalized to ^iUMAvif infections in the absence of A3 
enzymes. 

Sequencing of integrated proviral DNA 

Infected 293T cells were harvested after 48 h and the DNA was 
extracted using the Qiagen DNea.sy Blood and Tissue kit. DNA 
was treated with Dpnl (New England Biolabs) to remove possible 
contaminating plasmid DNA and the prot (nt 2280-2631) 
sequences were amplified by PGR using Phusion High Fidelity 
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Polymerase (New England Biolabs). Primers were obtained from 
Integrated DNA and are listed in Table SI. PGR products were 
purified and cloned with the Zero Blunt TOPO PGR cloning kit 

(Invitrogen). DNA sequencing was carried out at the National 
Research Council of Canada (Saskatoon, Saskatchewan). 

Quantitative immunoblotting 

The A3G and ASF enzymes were detected in cell lysates (40 |j,g 
total protein) and virions (130 ng of p24) used for single-cycle 
infectivity assays using antibodies to the native enzymes. For A3G 
we used the ApoG17 rabbit antiserum (Cat # 10082, NIH AIDS 
Reagent Program) and for A3F we used the C-18 polyclonal rabbit 
antibody (Gat # 1 1474, NIH AIDS Reagent Program). Loading 
controls for cell lysates (a-tubulin, Sigma) and virions (p24, Gat 
#3537, NIH AIDS Reagent Program) were detected using mouse 
monoclonal antibodies. Proteins of interest and loading controls 
were detected in parallel on the same gel by using the Licor/ 
Odyssey system (IRDye 680-labeled goat anti-rabbit secondary 
antibody and IRDye 800-labeled goat anti-mouse secondary 
antibody). Visualization with an Odyssey Infrared Imaging System 
(Licor) and analysis of bands with Odyssey software enabled 
intensities of bands to be determined. Analysis of a titration of 
purified A3G and ASF with their respective antibodies showed 
that ASF detection was 9-fold less sensitive than ASG detection at 
a 1/1000 antibody dilution. Further, doubling the amount of 
antibody to ASF (1/500) resulted in a doubling of the ASF 
detection sensitivity in comparison to the antibody to ASG (1/ 
1000). Therefore, an appropriate corrcitioii factor for the 
antibody dilution was used to adjust the integrated band intensities 
of ASF to enable comparison with ASG. Antibodies were used at a 
dilution of 1/1000 except for ASF or ASF NGM containing cell 
lysates which required a dilution of 1/500 for detection of ASF or 
ASF NGM. A t-test was used for statistical analysis. 

Supporting Information 

Figure SI Binding affinities of ASF and ASG for single- 
stranded (ss) DNA or double-stranded (ds) DNA. ASF and 

ASG binding to fluorescein labeled DNA(10 nM) was monitored 
with rotational anisotropy. (A) ssDNA as shown in Figure ID was 
used as a substrate. ASF binds this ssDNA with a high affinity 

(apparent K,, of 20 ±1 nM). (B-C) The double stranded region 
(20 nt) created in Figure IE was used as a binding substrate for (B) 
ASF or (C) ASG. (B) ASF was unable to bind the dsDNA to 
saturation in a concentration range similar to ssDNA. We were 
unable to concentrate ASF sufficientiy to titrate in the necessary 
amount to saturate the dsDNA substrate. The apparent K,i is 
estimated to be >600 nM. (C) A saturation curve for ASG binding 
to dsDNA is shown for comparison. ASG binds the dsDNA with 
an apparent of 82S±11 nM. Values are an average from at 
least two independent experiments. 
(TIF) 

Figure S2 Processivity of ASF, ASG, and ASG NPM in the 
presence of a 20 nt RNA/DNA hybrid. Deamination was 
tested on a 118 nt ssDNA substrate that contained an internal 
fluorescein (E^-label and two deamination motifs separated by 

63 nt ''sketch). A 20 nt complementary RNA was annealed 
between the two deamination motifs. Single deaminations of the 
5'C and 3'G are detected as the appearance of labeled 100- and 
81- nt fragments, respectively; double deamination of both C 
residues on the same molecule results in a 63 nt labeled 
fragment (5'C & 3'C). (A) ASF, (B) ASG, and (C) ASG NPM are 
able to processively deaminate the target cytosines by transver- 
sing the RNA/DNA hybrid region. ASF is 2-fold more 



processive than ASG and ASG NPM on this substrate. The 
measurements of processivity (Processivity factor) and the 
Standard Deviation of the mean (S.D.) are shown below the 
gel. The ASF: DNA ratio was 2:1 and the ASG: DNA and ASG 
NPM: DNA ratios were 1:20. Enzyme: DNA ratios were varied 
due to different specific activities of the enzyme on a given DNA 
substrate. Values are an average from three independent 
experiments. 
(TIF) 

Figure S3 Analysis of ASF processivity in the presence of 
a 20 nt dsDNA region and 5'ATC deamination motifs. 

Deamination was tested on a 118 nt ssDNA substrate that 
contained an internal fluorescein (F)-label and two 5'ATC 
deamination motifs separated by 6S nt (sketch). A 20 nt 
complementary DNA was annealed between the two deamination 
motifs. Single deaminations of the 5'G and S'G are detected as the 
appearance of labeled 100- and 81- nt fragments, respectively; 
double deamination of both C residues on the same molecule 
results in a 63 nt labeled fragment (5'C & 3'C). ASF is able to 
processively deaminate the target cytosines by transversing the 
dsDNA region. The measurements of processivity (Processivity 
factor) and the Standard Deviation of the mean (S.D.) are shown 
below the gel. The ASF: DNA ratio was 2:1. Values are an average 
from three independent experiments. 
(TIF) 

Figure S4 Processivity of ASF and ASG on a substrate 
with deamination motifs separated by 100 nt. Deamina- 
tion was tested on a 157 nt ssDNA substrate that contained an 
internal fluorescein (F)-label and either two 5'TTC (ASF) or 
5'CGG (ASG) deamination motifs (sketch). Single deaminations of 
the 5'G and S'G are detected as the appearance of labeled 137- 
and 120- nt fragments, respectively; double deamination of both C 
residues on the same molecule results in a 1 00 nt labeled fragment 
(5'C & S'G). ASF (left) and ASG (right) are able to processively 
deaminate the target cytosines. The measurements of processivity 
(Processivity factor) and the Standard Deviation of the mean (S.D.) 
are shown below the gel. The ASF: DNA ratio was 1:1 and the 
ASG: DNA ratio was 1:20. Values are an average from three 
independent experiments. 
(TIF) 

Figure S5 Increasing the total concentration of enzyme 
and substrate does not decrease the processivity of ASG. 

Deamination was tested on an 85 nt ssDNA substrate that 
contained an internal fluorescein (F)-label (yellow star) and two 
deamination motifs separated by 30 nt (sketch). Single deamina- 
tions of the 5'C and S'G are detected as the appearance of labeled 

67- and 48- nt fragments, respccti\-cly; double deamination of both 
C residues on the same molecule results in a SO nt labeled 
fragment (5'G & 3'C). The processivity of A3G was not 
significantiy changed when the enzyme: substrate (E:S) ratio 
(1:16) was kept constant, but reaction components increased (3: 
50 nM, 30: 500 nM, 60: 1000 nM). The measurements of 
processivity (Processivity factor) and the Standard Deviation of 
the mean (S.D.) are shown below the gel. Values are an average 
from three independent experiments. 
(TIF) 

Figure S6 Model of the N-terminal domain (NTD) of 

ASG. Model (grey) shows loop 7 and heUx 6 (both in red). The 
amino acids NPM were inserted at the end of predicted helix 6. 
Zinc atom is a dark grey sphere. The predicted model of ASG 
NTD was obtained by using the automated SWISS-MODEL 
program using the homologous ASG GTD (PDB: SIQS) structure 
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as a template. Figure was made using PyMOL (The PyMOL 
Molecular Graphics System, Version 1.5.0.5, Schrodinger, LLC). 
(TIF) 

Figure S7 GST-A3F is not processive. Deamination was 
tested on an 85 nt ssDNA substrate that contained an internal 
fluorescein (F)-label (yellow star) and two deamination motifs 

separated by 30 nt (sketch). Single deaminations of the 5'C and 
3'C are detected as the appearance of labeled 67- and 48- nt 
fragments, respectively; double deamination of both C residues on 
the same molecule results in a 30 nt labeled fragment (5'C & 3'C). 
GST-A3F is not processive on this substrate as evidenced by the 
absence of a double deamination band (5'C & 3'C, 30 nt). The 
A3F: DNA ratio was 1:1. A representative gel from three 
independent experiments is shown. 
(TIF) 

Figure S8 Representative eGFP sequences of integrated 

proviruses. Representative eGFP sequences from the single- 
cycle infectivity assay (Figure 9A) are shown. Mutations are in 
bold. Alignment was made using CLUSTAL W. 
(TIF) 

Figure S9 ASF CTD mutagenesis in a model HIV 
replication system. (A) Spectrum of mutations are plotted as 
the percentage of clones containing a mutation at a particular 
location (nt) in the 368 nt prot-lac^a. construct. (B-C) Analysis of 
the number of mutations induced by A3F-CTD in the (B) prot or 
(C) lac^x regions. 
(TIF) 

Figure SIO GST-A3F mutagenesis is comparable to ASF 
CTD in a model HIV replication system. (A) Spectrum of 
mutations are plotted as the percentage of clones containing a 
mutation at a particular location (nt) in the 368 nt prot-lac^OL 
construct. (B-C) Analysis of the number of mutations induced by 
GST-A3F in the (B) prot or (C) lacZ<x regions. 
(TIF) 

Figure Sll ASF CTD is not processive in the presence of 

NC and RT. Deamination was tested on an 118 nt ssDNA 
substrate that contained an internal fluorescein (F)-label (yeUow 
star) and two deamination motifs separated by 63 nt (sketch). 
Single deaminations of the 5'C and 3'C are detected as the 
appearance of labeled 100- and 81- nt fragments, respectively; 
double deamination of both C residues on the same molecule 
results in a 63 nt labeled fragment (5'C & 3'C). ASF CTD is 
unable to processively deaminate the target cytosines as evidenced 
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